Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 7905 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 225.4 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 10 |
Age is highly overall correlated with Age_Years and 1 other fields | High correlation |
Age_Years is highly overall correlated with Age and 1 other fields | High correlation |
Ascites is highly overall correlated with Edema_N and 1 other fields | High correlation |
Bilirubin is highly overall correlated with Copper | High correlation |
Copper is highly overall correlated with Bilirubin | High correlation |
Diagnosis_Date is highly overall correlated with Age and 1 other fields | High correlation |
Edema_N is highly overall correlated with Ascites and 2 other fields | High correlation |
Edema_S is highly overall correlated with Edema_N | High correlation |
Edema_Y is highly overall correlated with Ascites and 1 other fields | High correlation |
Hepatomegaly is highly overall correlated with Stage | High correlation |
Stage is highly overall correlated with Hepatomegaly | High correlation |
Edema_N is highly imbalanced (55.0%) | Imbalance |
Edema_S is highly imbalanced (71.2%) | Imbalance |
Edema_Y is highly imbalanced (74.1%) | Imbalance |
Sex is highly imbalanced (62.7%) | Imbalance |
Ascites is highly imbalanced (72.2%) | Imbalance |
Reproduction
| Analysis started | 2024-02-12 06:31:20.440654 |
|---|---|
| Analysis finished | 2024-02-12 06:31:35.327910 |
| Duration | 14.89 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
N_Days
Real number (ℝ)
| Distinct | 461 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2030.1733 |
| Minimum | 41 |
|---|---|
| Maximum | 4795 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 41 |
|---|---|
| 5-th percentile | 334 |
| Q1 | 1230 |
| median | 1831 |
| Q3 | 2689 |
| 95-th percentile | 4127 |
| Maximum | 4795 |
| Range | 4754 |
| Interquartile range (IQR) | 1459 |
Descriptive statistics
| Standard deviation | 1094.2337 |
|---|---|
| Coefficient of variation (CV) | 0.53898539 |
| Kurtosis | -0.49401726 |
| Mean | 2030.1733 |
| Median Absolute Deviation (MAD) | 724 |
| Skewness | 0.44865975 |
| Sum | 16048520 |
| Variance | 1197347.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1216 | 117 | 1.5% |
| 1434 | 105 | 1.3% |
| 769 | 83 | 1.0% |
| 3445 | 73 | 0.9% |
| 1765 | 64 | 0.8% |
| 1785 | 64 | 0.8% |
| 1363 | 60 | 0.8% |
| 904 | 59 | 0.7% |
| 334 | 58 | 0.7% |
| 2294 | 56 | 0.7% |
| Other values (451) | 7166 |
| Value | Count | Frequency (%) |
| 41 | 13 | |
| 51 | 16 | |
| 71 | 14 | |
| 76 | 1 | < 0.1% |
| 77 | 21 | |
| 78 | 1 | < 0.1% |
| 108 | 1 | < 0.1% |
| 110 | 25 | |
| 121 | 1 | < 0.1% |
| 124 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4795 | 7 | 0.1% |
| 4556 | 51 | |
| 4523 | 15 | 0.2% |
| 4509 | 41 | |
| 4500 | 28 | |
| 4467 | 14 | 0.2% |
| 4459 | 19 | 0.2% |
| 4453 | 22 | |
| 4427 | 14 | 0.2% |
| 4392 | 1 | < 0.1% |
Age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 391 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18373.146 |
| Minimum | 9598 |
|---|---|
| Maximum | 28650 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 9598 |
|---|---|
| 5-th percentile | 12307 |
| Q1 | 15574 |
| median | 18713 |
| Q3 | 20684 |
| 95-th percentile | 24622 |
| Maximum | 28650 |
| Range | 19052 |
| Interquartile range (IQR) | 5110 |
Descriptive statistics
| Standard deviation | 3679.9587 |
|---|---|
| Coefficient of variation (CV) | 0.20029007 |
| Kurtosis | -0.49738238 |
| Mean | 18373.146 |
| Median Absolute Deviation (MAD) | 2604 |
| Skewness | 0.084091298 |
| Sum | 1.4523972 × 108 |
| Variance | 13542096 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22369 | 79 | 1.0% |
| 22388 | 71 | 0.9% |
| 20684 | 71 | 0.9% |
| 19060 | 70 | 0.9% |
| 16279 | 66 | 0.8% |
| 20459 | 65 | 0.8% |
| 19246 | 62 | 0.8% |
| 14161 | 62 | 0.8% |
| 22960 | 61 | 0.8% |
| 23331 | 61 | 0.8% |
| Other values (381) | 7237 |
| Value | Count | Frequency (%) |
| 9598 | 18 | |
| 10550 | 17 | |
| 10795 | 7 | 0.1% |
| 10810 | 1 | < 0.1% |
| 10958 | 1 | < 0.1% |
| 11058 | 33 | |
| 11167 | 10 | 0.1% |
| 11273 | 19 | |
| 11330 | 1 | < 0.1% |
| 11462 | 19 |
| Value | Count | Frequency (%) |
| 28650 | 36 | |
| 28018 | 5 | 0.1% |
| 27398 | 22 | |
| 27394 | 1 | < 0.1% |
| 27239 | 1 | < 0.1% |
| 27220 | 23 | |
| 26580 | 8 | 0.1% |
| 26567 | 1 | < 0.1% |
| 26259 | 13 | 0.2% |
| 25899 | 20 |
Bilirubin
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 111 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5944845 |
| Minimum | 0.3 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 0.7 |
| median | 1.1 |
| Q3 | 3 |
| 95-th percentile | 11 |
| Maximum | 28 |
| Range | 27.7 |
| Interquartile range (IQR) | 2.3 |
Descriptive statistics
| Standard deviation | 3.8129603 |
|---|---|
| Coefficient of variation (CV) | 1.4696408 |
| Kurtosis | 12.908824 |
| Mean | 2.5944845 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 3.3396953 |
| Sum | 20509.4 |
| Variance | 14.538666 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6 | 847 | 10.7% |
| 0.7 | 653 | 8.3% |
| 0.8 | 613 | 7.8% |
| 0.9 | 608 | 7.7% |
| 0.5 | 552 | 7.0% |
| 1.1 | 443 | 5.6% |
| 1.3 | 368 | 4.7% |
| 1 | 292 | 3.7% |
| 0.4 | 180 | 2.3% |
| 1.4 | 175 | 2.2% |
| Other values (101) | 3174 |
| Value | Count | Frequency (%) |
| 0.3 | 52 | 0.7% |
| 0.4 | 180 | 2.3% |
| 0.5 | 552 | |
| 0.6 | 847 | |
| 0.7 | 653 | |
| 0.8 | 613 | |
| 0.9 | 608 | |
| 1 | 292 | 3.7% |
| 1.1 | 443 | |
| 1.2 | 166 | 2.1% |
| Value | Count | Frequency (%) |
| 28 | 13 | |
| 25.5 | 13 | |
| 24.5 | 16 | |
| 22.5 | 16 | |
| 21.9 | 1 | < 0.1% |
| 21.6 | 19 | |
| 21.4 | 1 | < 0.1% |
| 20 | 4 | 0.1% |
| 18 | 4 | 0.1% |
| 17.9 | 9 |
Cholesterol
Real number (ℝ)
| Distinct | 226 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 350.56192 |
| Minimum | 120 |
|---|---|
| Maximum | 1775 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 120 |
|---|---|
| 5-th percentile | 198 |
| Q1 | 248 |
| median | 298 |
| Q3 | 390 |
| 95-th percentile | 646 |
| Maximum | 1775 |
| Range | 1655 |
| Interquartile range (IQR) | 142 |
Descriptive statistics
| Standard deviation | 195.37934 |
|---|---|
| Coefficient of variation (CV) | 0.5573319 |
| Kurtosis | 18.162327 |
| Mean | 350.56192 |
| Median Absolute Deviation (MAD) | 62 |
| Skewness | 3.6796575 |
| Sum | 2771192 |
| Variance | 38173.088 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 448 | 152 | 1.9% |
| 248 | 151 | 1.9% |
| 263 | 143 | 1.8% |
| 298 | 138 | 1.7% |
| 232 | 131 | 1.7% |
| 260 | 120 | 1.5% |
| 257 | 117 | 1.5% |
| 316 | 110 | 1.4% |
| 236 | 109 | 1.4% |
| 280 | 106 | 1.3% |
| Other values (216) | 6628 |
| Value | Count | Frequency (%) |
| 120 | 10 | 0.1% |
| 127 | 18 | 0.2% |
| 132 | 36 | |
| 134 | 1 | < 0.1% |
| 149 | 7 | 0.1% |
| 151 | 9 | 0.1% |
| 168 | 9 | 0.1% |
| 172 | 19 | 0.2% |
| 174 | 20 | 0.3% |
| 175 | 58 |
| Value | Count | Frequency (%) |
| 1775 | 11 | |
| 1712 | 19 | |
| 1600 | 22 | |
| 1492 | 1 | < 0.1% |
| 1480 | 11 | |
| 1436 | 1 | < 0.1% |
| 1336 | 9 | |
| 1276 | 21 | |
| 1236 | 1 | < 0.1% |
| 1128 | 14 |
Albumin
Real number (ℝ)
| Distinct | 160 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5483226 |
| Minimum | 1.96 |
|---|---|
| Maximum | 4.64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 1.96 |
|---|---|
| 5-th percentile | 2.97 |
| Q1 | 3.35 |
| median | 3.58 |
| Q3 | 3.77 |
| 95-th percentile | 4.08 |
| Maximum | 4.64 |
| Range | 2.68 |
| Interquartile range (IQR) | 0.42 |
Descriptive statistics
| Standard deviation | 0.34617081 |
|---|---|
| Coefficient of variation (CV) | 0.097559002 |
| Kurtosis | 1.3396217 |
| Mean | 3.5483226 |
| Median Absolute Deviation (MAD) | 0.21 |
| Skewness | -0.5611495 |
| Sum | 28049.49 |
| Variance | 0.11983423 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.35 | 370 | 4.7% |
| 3.6 | 368 | 4.7% |
| 3.7 | 326 | 4.1% |
| 3.85 | 255 | 3.2% |
| 3.5 | 223 | 2.8% |
| 3.77 | 217 | 2.7% |
| 3.26 | 195 | 2.5% |
| 3.65 | 183 | 2.3% |
| 3.61 | 166 | 2.1% |
| 3.2 | 161 | 2.0% |
| Other values (150) | 5441 |
| Value | Count | Frequency (%) |
| 1.96 | 4 | 0.1% |
| 2.1 | 4 | 0.1% |
| 2.23 | 3 | < 0.1% |
| 2.27 | 4 | 0.1% |
| 2.31 | 4 | 0.1% |
| 2.33 | 16 | 0.2% |
| 2.35 | 1 | < 0.1% |
| 2.43 | 50 | |
| 2.52 | 1 | < 0.1% |
| 2.53 | 9 | 0.1% |
| Value | Count | Frequency (%) |
| 4.64 | 20 | |
| 4.52 | 5 | 0.1% |
| 4.4 | 14 | 0.2% |
| 4.38 | 24 | |
| 4.34 | 1 | < 0.1% |
| 4.31 | 1 | < 0.1% |
| 4.3 | 42 | |
| 4.26 | 1 | < 0.1% |
| 4.24 | 12 | 0.2% |
| 4.23 | 19 |
Copper
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 171 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.902846 |
| Minimum | 4 |
|---|---|
| Maximum | 588 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 39 |
| median | 63 |
| Q3 | 102 |
| 95-th percentile | 231 |
| Maximum | 588 |
| Range | 584 |
| Interquartile range (IQR) | 63 |
Descriptive statistics
| Standard deviation | 75.899266 |
|---|---|
| Coefficient of variation (CV) | 0.90460895 |
| Kurtosis | 10.21299 |
| Mean | 83.902846 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 2.7017358 |
| Sum | 663252 |
| Variance | 5760.6986 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 67 | 311 | 3.9% |
| 52 | 303 | 3.8% |
| 39 | 216 | 2.7% |
| 58 | 207 | 2.6% |
| 75 | 188 | 2.4% |
| 41 | 179 | 2.3% |
| 13 | 172 | 2.2% |
| 20 | 169 | 2.1% |
| 44 | 154 | 1.9% |
| 38 | 151 | 1.9% |
| Other values (161) | 5855 |
| Value | Count | Frequency (%) |
| 4 | 12 | 0.2% |
| 5 | 2 | < 0.1% |
| 9 | 53 | 0.7% |
| 10 | 25 | 0.3% |
| 11 | 60 | 0.8% |
| 12 | 36 | 0.5% |
| 13 | 172 | |
| 14 | 42 | 0.5% |
| 15 | 11 | 0.1% |
| 16 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 588 | 19 | |
| 558 | 7 | 0.1% |
| 464 | 26 | |
| 456 | 1 | < 0.1% |
| 444 | 21 | |
| 412 | 13 | 0.2% |
| 380 | 43 | |
| 358 | 21 | |
| 308 | 4 | 0.1% |
| 290 | 20 |
Alk_Phos
Real number (ℝ)
| Distinct | 364 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1816.7452 |
| Minimum | 289 |
|---|---|
| Maximum | 13862.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 289 |
|---|---|
| 5-th percentile | 614 |
| Q1 | 834 |
| median | 1181 |
| Q3 | 1857 |
| 95-th percentile | 6064.8 |
| Maximum | 13862.4 |
| Range | 13573.4 |
| Interquartile range (IQR) | 1023 |
Descriptive statistics
| Standard deviation | 1903.7507 |
|---|---|
| Coefficient of variation (CV) | 1.0478908 |
| Kurtosis | 11.59975 |
| Mean | 1816.7452 |
| Median Absolute Deviation (MAD) | 460 |
| Skewness | 3.1955577 |
| Sum | 14361371 |
| Variance | 3624266.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 663 | 117 | 1.5% |
| 1345 | 81 | 1.0% |
| 7277 | 79 | 1.0% |
| 944 | 78 | 1.0% |
| 794 | 76 | 1.0% |
| 645 | 76 | 1.0% |
| 1636 | 76 | 1.0% |
| 1052 | 75 | 0.9% |
| 2276 | 63 | 0.8% |
| 674 | 63 | 0.8% |
| Other values (354) | 7121 |
| Value | Count | Frequency (%) |
| 289 | 32 | |
| 310 | 10 | 0.1% |
| 369 | 21 | |
| 377 | 17 | |
| 414 | 8 | 0.1% |
| 423 | 31 | |
| 453 | 26 | |
| 466 | 16 | |
| 516 | 12 | 0.2% |
| 554 | 31 |
| Value | Count | Frequency (%) |
| 13862.4 | 15 | |
| 13486.2 | 1 | < 0.1% |
| 12258.8 | 26 | |
| 11552 | 11 | |
| 11320.2 | 15 | |
| 11046.6 | 12 | |
| 10795.4 | 1 | < 0.1% |
| 10396.8 | 22 | |
| 10165 | 11 | |
| 9933.2 | 3 | < 0.1% |
SGOT
Real number (ℝ)
| Distinct | 206 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 114.6046 |
| Minimum | 26.35 |
|---|---|
| Maximum | 457.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 26.35 |
|---|---|
| 5-th percentile | 54.25 |
| Q1 | 75.95 |
| median | 108.5 |
| Q3 | 137.95 |
| 95-th percentile | 198.4 |
| Maximum | 457.25 |
| Range | 430.9 |
| Interquartile range (IQR) | 62 |
Descriptive statistics
| Standard deviation | 48.790945 |
|---|---|
| Coefficient of variation (CV) | 0.42573286 |
| Kurtosis | 5.8167874 |
| Mean | 114.6046 |
| Median Absolute Deviation (MAD) | 31 |
| Skewness | 1.5348057 |
| Sum | 905949.38 |
| Variance | 2380.5563 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 71.3 | 256 | 3.2% |
| 57.35 | 247 | 3.1% |
| 137.95 | 206 | 2.6% |
| 120.9 | 198 | 2.5% |
| 97.65 | 189 | 2.4% |
| 170.5 | 184 | 2.3% |
| 93 | 178 | 2.3% |
| 128.65 | 170 | 2.2% |
| 66.65 | 138 | 1.7% |
| 106.95 | 137 | 1.7% |
| Other values (196) | 6002 |
| Value | Count | Frequency (%) |
| 26.35 | 8 | 0.1% |
| 28.38 | 12 | 0.2% |
| 40.6 | 1 | < 0.1% |
| 41.85 | 16 | 0.2% |
| 43.4 | 40 | |
| 45 | 14 | 0.2% |
| 46.5 | 6 | 0.1% |
| 49.6 | 52 | |
| 51.15 | 57 | |
| 52 | 15 | 0.2% |
| Value | Count | Frequency (%) |
| 457.25 | 17 | |
| 338 | 9 | |
| 328.6 | 15 | |
| 299.15 | 6 | 0.1% |
| 288 | 9 | |
| 280.55 | 15 | |
| 272.8 | 9 | |
| 260.15 | 1 | < 0.1% |
| 253 | 1 | < 0.1% |
| 246.45 | 13 |
Tryglicerides
Real number (ℝ)
| Distinct | 154 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.34016 |
| Minimum | 33 |
|---|---|
| Maximum | 598 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 84 |
| median | 104 |
| Q3 | 139 |
| 95-th percentile | 210 |
| Maximum | 598 |
| Range | 565 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 52.530402 |
|---|---|
| Coefficient of variation (CV) | 0.45543894 |
| Kurtosis | 15.048118 |
| Mean | 115.34016 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 2.6339208 |
| Sum | 911764 |
| Variance | 2759.4431 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 262 | 3.3% |
| 85 | 223 | 2.8% |
| 91 | 218 | 2.8% |
| 118 | 211 | 2.7% |
| 68 | 188 | 2.4% |
| 56 | 187 | 2.4% |
| 146 | 181 | 2.3% |
| 108 | 175 | 2.2% |
| 55 | 171 | 2.2% |
| 133 | 170 | 2.2% |
| Other values (144) | 5919 |
| Value | Count | Frequency (%) |
| 33 | 13 | 0.2% |
| 44 | 37 | 0.5% |
| 46 | 12 | 0.2% |
| 49 | 13 | 0.2% |
| 50 | 19 | 0.2% |
| 52 | 24 | 0.3% |
| 53 | 15 | 0.2% |
| 55 | 171 | |
| 56 | 187 | |
| 57 | 10 | 0.1% |
| Value | Count | Frequency (%) |
| 598 | 13 | |
| 432 | 16 | |
| 393 | 1 | < 0.1% |
| 382 | 4 | 0.1% |
| 322 | 5 | 0.1% |
| 319 | 15 | |
| 318 | 18 | |
| 309 | 20 | |
| 283 | 1 | < 0.1% |
| 280 | 20 |
Platelets
Real number (ℝ)
| Distinct | 227 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 265.22897 |
| Minimum | 62 |
|---|---|
| Maximum | 563 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 62 |
|---|---|
| 5-th percentile | 128 |
| Q1 | 211 |
| median | 265 |
| Q3 | 316 |
| 95-th percentile | 430 |
| Maximum | 563 |
| Range | 501 |
| Interquartile range (IQR) | 105 |
Descriptive statistics
| Standard deviation | 87.465579 |
|---|---|
| Coefficient of variation (CV) | 0.32977385 |
| Kurtosis | 0.33057783 |
| Mean | 265.22897 |
| Median Absolute Deviation (MAD) | 53 |
| Skewness | 0.42004793 |
| Sum | 2096635 |
| Variance | 7650.2274 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 344 | 233 | 2.9% |
| 228 | 159 | 2.0% |
| 268 | 158 | 2.0% |
| 295 | 154 | 1.9% |
| 336 | 147 | 1.9% |
| 251 | 144 | 1.8% |
| 265 | 138 | 1.7% |
| 269 | 136 | 1.7% |
| 213 | 136 | 1.7% |
| 309 | 132 | 1.7% |
| Other values (217) | 6368 |
| Value | Count | Frequency (%) |
| 62 | 11 | 0.1% |
| 65 | 1 | < 0.1% |
| 70 | 10 | 0.1% |
| 71 | 15 | 0.2% |
| 76 | 1 | < 0.1% |
| 79 | 18 | |
| 80 | 25 | |
| 81 | 11 | 0.1% |
| 88 | 3 | < 0.1% |
| 95 | 38 |
| Value | Count | Frequency (%) |
| 563 | 36 | |
| 539 | 5 | 0.1% |
| 518 | 14 | 0.2% |
| 515 | 2 | < 0.1% |
| 514 | 13 | 0.2% |
| 493 | 17 | |
| 487 | 10 | 0.1% |
| 474 | 17 | |
| 471 | 24 | |
| 467 | 40 |
Prothrombin
Real number (ℝ)
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.629462 |
| Minimum | 9 |
|---|---|
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 9.6 |
| Q1 | 10 |
| median | 10.6 |
| Q3 | 11 |
| 95-th percentile | 12 |
| Maximum | 18 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.78173483 |
|---|---|
| Coefficient of variation (CV) | 0.073544155 |
| Kurtosis | 4.288955 |
| Mean | 10.629462 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 1.292436 |
| Sum | 84025.9 |
| Variance | 0.61110934 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.6 | 1070 | 13.5% |
| 11 | 842 | 10.7% |
| 10 | 638 | 8.1% |
| 9.9 | 517 | 6.5% |
| 9.8 | 440 | 5.6% |
| 10.1 | 390 | 4.9% |
| 10.9 | 339 | 4.3% |
| 11.5 | 295 | 3.7% |
| 9.6 | 288 | 3.6% |
| 10.2 | 283 | 3.6% |
| Other values (39) | 2803 |
| Value | Count | Frequency (%) |
| 9 | 8 | 0.1% |
| 9.1 | 9 | 0.1% |
| 9.2 | 5 | 0.1% |
| 9.3 | 8 | 0.1% |
| 9.4 | 17 | 0.2% |
| 9.5 | 137 | 1.7% |
| 9.6 | 288 | |
| 9.7 | 199 | 2.5% |
| 9.8 | 440 | |
| 9.9 | 517 |
| Value | Count | Frequency (%) |
| 18 | 1 | < 0.1% |
| 17.1 | 2 | < 0.1% |
| 15.2 | 12 | 0.2% |
| 14.1 | 4 | 0.1% |
| 13.6 | 9 | 0.1% |
| 13.4 | 1 | < 0.1% |
| 13.3 | 6 | 0.1% |
| 13.2 | 32 | |
| 13.1 | 1 | < 0.1% |
| 13 | 45 |
Status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 275 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7905 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4965 | |
| 2 | 2665 | |
| 1 | 275 | 3.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 4965 | |
| 2 | 2665 | |
| 1 | 275 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4965 | |
| 2 | 2665 | |
| 1 | 275 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4965 | |
| 2 | 2665 | |
| 1 | 275 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7905 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4965 | |
| 2 | 2665 | |
| 1 | 275 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7905 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4965 | |
| 2 | 2665 | |
| 1 | 275 | 3.5% |
Diagnosis_Date
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 4621 |
|---|---|
| Distinct (%) | 58.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16342.973 |
| Minimum | 5755 |
|---|---|
| Maximum | 28451 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 5755 |
|---|---|
| 5-th percentile | 10212 |
| Q1 | 13293 |
| median | 16320 |
| Q3 | 18947 |
| 95-th percentile | 23001.8 |
| Maximum | 28451 |
| Range | 22696 |
| Interquartile range (IQR) | 5654 |
Descriptive statistics
| Standard deviation | 3945.0917 |
|---|---|
| Coefficient of variation (CV) | 0.24139376 |
| Kurtosis | -0.44258475 |
| Mean | 16342.973 |
| Median Absolute Deviation (MAD) | 2843 |
| Skewness | 0.16929262 |
| Sum | 1.291912 × 108 |
| Variance | 15563749 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22035 | 45 | 0.6% |
| 18634 | 36 | 0.5% |
| 18822 | 34 | 0.4% |
| 21484 | 33 | 0.4% |
| 17935 | 33 | 0.4% |
| 18291 | 30 | 0.4% |
| 26885 | 25 | 0.3% |
| 12715 | 24 | 0.3% |
| 12139 | 23 | 0.3% |
| 15255 | 21 | 0.3% |
| Other values (4611) | 7601 |
| Value | Count | Frequency (%) |
| 5755 | 1 | < 0.1% |
| 6511 | 1 | < 0.1% |
| 6591 | 2 | < 0.1% |
| 6611 | 1 | < 0.1% |
| 6626 | 1 | < 0.1% |
| 6711 | 1 | < 0.1% |
| 6727 | 7 | |
| 6983 | 1 | < 0.1% |
| 7142 | 1 | < 0.1% |
| 7235 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 28451 | 1 | < 0.1% |
| 27720 | 1 | < 0.1% |
| 27697 | 1 | < 0.1% |
| 27650 | 1 | < 0.1% |
| 27630 | 1 | < 0.1% |
| 27618 | 1 | < 0.1% |
| 27283 | 1 | < 0.1% |
| 27215 | 1 | < 0.1% |
| 27200 | 1 | < 0.1% |
| 26885 | 25 |
Age_Years
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.308033 |
| Minimum | 26 |
|---|---|
| Maximum | 78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 381.6 KiB |
Quantile statistics
| Minimum | 26 |
|---|---|
| 5-th percentile | 34 |
| Q1 | 43 |
| median | 51 |
| Q3 | 57 |
| 95-th percentile | 67 |
| Maximum | 78 |
| Range | 52 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.085483 |
|---|---|
| Coefficient of variation (CV) | 0.20047461 |
| Kurtosis | -0.49383657 |
| Mean | 50.308033 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.082175664 |
| Sum | 397685 |
| Variance | 101.71697 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 56 | 549 | 6.9% |
| 53 | 401 | 5.1% |
| 61 | 388 | 4.9% |
| 52 | 340 | 4.3% |
| 50 | 338 | 4.3% |
| 41 | 335 | 4.2% |
| 51 | 292 | 3.7% |
| 46 | 275 | 3.5% |
| 57 | 274 | 3.5% |
| 49 | 256 | 3.2% |
| Other values (39) | 4457 |
| Value | Count | Frequency (%) |
| 26 | 18 | 0.2% |
| 29 | 17 | 0.2% |
| 30 | 42 | 0.5% |
| 31 | 68 | 0.9% |
| 32 | 54 | 0.7% |
| 33 | 142 | |
| 34 | 160 | |
| 35 | 182 | |
| 36 | 113 | |
| 37 | 128 |
| Value | Count | Frequency (%) |
| 78 | 36 | |
| 77 | 5 | 0.1% |
| 75 | 47 | |
| 73 | 9 | 0.1% |
| 72 | 13 | 0.2% |
| 71 | 64 | |
| 70 | 68 | |
| 69 | 40 | |
| 68 | 80 | |
| 67 | 85 |
Edema_N
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 7161 | |
| 0.0 | 744 | 9.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 7161 | |
| 0.0 | 744 | 9.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8649 | |
| . | 7905 | |
| 1 | 7161 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8649 | |
| 1 | 7161 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8649 | |
| . | 7905 | |
| 1 | 7161 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8649 | |
| . | 7905 | |
| 1 | 7161 |
Edema_S
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 0.0 | |
|---|---|
| 1.0 | 399 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 7506 | |
| 1.0 | 399 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 7506 | |
| 1.0 | 399 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15411 | |
| . | 7905 | |
| 1 | 399 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15411 | |
| 1 | 399 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15411 | |
| . | 7905 | |
| 1 | 399 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15411 | |
| . | 7905 | |
| 1 | 399 | 1.7% |
Edema_Y
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 0.0 | |
|---|---|
| 1.0 | 345 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 7560 | |
| 1.0 | 345 | 4.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 7560 | |
| 1.0 | 345 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15465 | |
| . | 7905 | |
| 1 | 345 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15465 | |
| 1 | 345 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15465 | |
| . | 7905 | |
| 1 | 345 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15465 | |
| . | 7905 | |
| 1 | 345 | 1.5% |
Drug
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4010 | |
| 0.0 | 3895 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 4010 | |
| 0.0 | 3895 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11800 | |
| . | 7905 | |
| 1 | 4010 | 16.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11800 | |
| 1 | 4010 | 25.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11800 | |
| . | 7905 | |
| 1 | 4010 | 16.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11800 | |
| . | 7905 | |
| 1 | 4010 | 16.9% |
Sex
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 0.0 | |
|---|---|
| 1.0 | 569 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 7336 | |
| 1.0 | 569 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 7336 | |
| 1.0 | 569 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15241 | |
| . | 7905 | |
| 1 | 569 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15241 | |
| 1 | 569 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15241 | |
| . | 7905 | |
| 1 | 569 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15241 | |
| . | 7905 | |
| 1 | 569 | 2.4% |
Ascites
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 0.0 | |
|---|---|
| 1.0 | 380 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 7525 | |
| 1.0 | 380 | 4.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 7525 | |
| 1.0 | 380 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15430 | |
| . | 7905 | |
| 1 | 380 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15430 | |
| 1 | 380 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15430 | |
| . | 7905 | |
| 1 | 380 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15430 | |
| . | 7905 | |
| 1 | 380 | 1.6% |
Hepatomegaly
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4042 | |
| 0.0 | 3863 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 4042 | |
| 0.0 | 3863 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11768 | |
| . | 7905 | |
| 1 | 4042 | 17.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11768 | |
| 1 | 4042 | 25.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11768 | |
| . | 7905 | |
| 1 | 4042 | 17.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11768 | |
| . | 7905 | |
| 1 | 4042 | 17.0% |
Spiders
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 5966 | |
| 1.0 | 1939 | 24.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 5966 | |
| 1.0 | 1939 | 24.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13871 | |
| . | 7905 | |
| 1 | 1939 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13871 | |
| 1 | 1939 | 12.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13871 | |
| . | 7905 | |
| 1 | 1939 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13871 | |
| . | 7905 | |
| 1 | 1939 | 8.2% |
Stage
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.6 KiB |
| 2.0 | |
|---|---|
| 3.0 | |
| 1.0 | |
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23715 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 3.0 |
| 4th row | 2.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 3153 | |
| 3.0 | 2703 | |
| 1.0 | 1652 | |
| 0.0 | 397 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 3153 | |
| 3.0 | 2703 | |
| 1.0 | 1652 | |
| 0.0 | 397 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8302 | |
| . | 7905 | |
| 2 | 3153 | 13.3% |
| 3 | 2703 | 11.4% |
| 1 | 1652 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15810 | |
| Other Punctuation | 7905 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8302 | |
| 2 | 3153 | 19.9% |
| 3 | 2703 | 17.1% |
| 1 | 1652 | 10.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8302 | |
| . | 7905 | |
| 2 | 3153 | 13.3% |
| 3 | 2703 | 11.4% |
| 1 | 1652 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8302 | |
| . | 7905 | |
| 2 | 3153 | 13.3% |
| 3 | 2703 | 11.4% |
| 1 | 1652 | 7.0% |
| Age | Age_Years | Albumin | Alk_Phos | Ascites | Bilirubin | Cholesterol | Copper | Diagnosis_Date | Drug | Edema_N | Edema_S | Edema_Y | Hepatomegaly | N_Days | Platelets | Prothrombin | SGOT | Sex | Spiders | Stage | Status | Tryglicerides | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.999 | -0.079 | -0.041 | 0.191 | 0.055 | -0.077 | 0.034 | 0.958 | 0.129 | 0.168 | 0.094 | 0.172 | 0.126 | -0.104 | -0.097 | 0.134 | -0.037 | 0.146 | 0.082 | 0.104 | 0.173 | 0.021 |
| Age_Years | 0.999 | 1.000 | -0.079 | -0.041 | 0.213 | 0.055 | -0.076 | 0.035 | 0.958 | 0.134 | 0.194 | 0.106 | 0.189 | 0.141 | -0.103 | -0.096 | 0.133 | -0.037 | 0.148 | 0.084 | 0.104 | 0.183 | 0.021 |
| Albumin | -0.079 | -0.079 | 1.000 | -0.167 | 0.443 | -0.304 | -0.054 | -0.236 | -0.136 | 0.105 | 0.347 | 0.089 | 0.435 | 0.268 | 0.241 | 0.125 | -0.167 | -0.220 | 0.060 | 0.230 | 0.153 | 0.223 | -0.113 |
| Alk_Phos | -0.041 | -0.041 | -0.167 | 1.000 | 0.122 | 0.333 | 0.320 | 0.281 | -0.001 | 0.053 | 0.129 | 0.099 | 0.126 | 0.210 | -0.147 | 0.052 | 0.092 | 0.427 | 0.036 | 0.099 | 0.063 | 0.171 | 0.195 |
| Ascites | 0.191 | 0.213 | 0.443 | 0.122 | 1.000 | 0.263 | -0.096 | 0.222 | 0.198 | 0.045 | 0.526 | 0.087 | 0.657 | 0.184 | -0.260 | -0.183 | 0.264 | 0.126 | 0.033 | 0.209 | 0.195 | 0.276 | 0.110 |
| Bilirubin | 0.055 | 0.055 | -0.304 | 0.333 | 0.263 | 1.000 | 0.325 | 0.586 | 0.155 | 0.084 | 0.341 | 0.160 | 0.390 | 0.335 | -0.405 | -0.168 | 0.268 | 0.499 | 0.106 | 0.321 | 0.137 | 0.348 | 0.316 |
| Cholesterol | -0.077 | -0.076 | -0.054 | 0.320 | -0.096 | 0.325 | 1.000 | 0.255 | -0.039 | 0.079 | 0.044 | 0.000 | 0.052 | 0.136 | -0.123 | 0.124 | -0.050 | 0.347 | 0.049 | 0.072 | 0.035 | 0.156 | 0.332 |
| Copper | 0.034 | 0.035 | -0.236 | 0.281 | 0.222 | 0.586 | 0.255 | 1.000 | 0.118 | 0.072 | 0.298 | 0.135 | 0.306 | 0.312 | -0.338 | -0.126 | 0.209 | 0.438 | 0.179 | 0.282 | 0.138 | 0.324 | 0.341 |
| Diagnosis_Date | 0.958 | 0.958 | -0.136 | -0.001 | 0.198 | 0.155 | -0.039 | 0.118 | 1.000 | 0.099 | 0.251 | 0.111 | 0.242 | 0.181 | -0.360 | -0.125 | 0.157 | 0.040 | 0.169 | 0.125 | 0.118 | 0.205 | 0.078 |
| Drug | 0.129 | 0.134 | 0.105 | 0.053 | 0.045 | 0.084 | 0.079 | 0.072 | 0.099 | 1.000 | 0.025 | 0.000 | 0.033 | 0.062 | 0.004 | 0.020 | 0.028 | 0.041 | 0.043 | 0.000 | 0.027 | 0.022 | 0.073 |
| Edema_N | 0.168 | 0.194 | 0.347 | 0.129 | 0.526 | 0.341 | 0.044 | 0.298 | 0.251 | 0.025 | 1.000 | 0.714 | 0.662 | 0.224 | 0.258 | 0.177 | -0.297 | -0.114 | 0.051 | 0.257 | 0.231 | 0.328 | -0.071 |
| Edema_S | 0.094 | 0.106 | 0.089 | 0.099 | 0.087 | 0.160 | 0.000 | 0.135 | 0.111 | 0.000 | 0.714 | 1.000 | 0.047 | 0.135 | -0.109 | -0.070 | 0.150 | 0.054 | 0.070 | 0.133 | 0.116 | 0.171 | 0.021 |
| Edema_Y | 0.172 | 0.189 | 0.435 | 0.126 | 0.657 | 0.390 | 0.052 | 0.306 | 0.242 | 0.033 | 0.662 | 0.047 | 1.000 | 0.174 | -0.252 | -0.178 | 0.264 | 0.105 | 0.000 | 0.223 | 0.206 | 0.286 | 0.079 |
| Hepatomegaly | 0.126 | 0.141 | 0.268 | 0.210 | 0.184 | 0.335 | 0.136 | 0.312 | 0.181 | 0.062 | 0.224 | 0.135 | 0.174 | 1.000 | -0.293 | -0.200 | 0.251 | 0.231 | 0.065 | 0.329 | 0.526 | 0.396 | 0.181 |
| N_Days | -0.104 | -0.103 | 0.241 | -0.147 | -0.260 | -0.405 | -0.123 | -0.338 | -0.360 | 0.004 | 0.258 | -0.109 | -0.252 | -0.293 | 1.000 | 0.155 | -0.150 | -0.281 | 0.086 | 0.271 | 0.175 | 0.349 | -0.209 |
| Platelets | -0.097 | -0.096 | 0.125 | 0.052 | -0.183 | -0.168 | 0.124 | -0.126 | -0.125 | 0.020 | 0.177 | -0.070 | -0.178 | -0.200 | 0.155 | 1.000 | -0.179 | -0.036 | 0.057 | 0.212 | 0.134 | 0.172 | -0.013 |
| Prothrombin | 0.134 | 0.133 | -0.167 | 0.092 | 0.264 | 0.268 | -0.050 | 0.209 | 0.157 | 0.028 | -0.297 | 0.150 | 0.264 | 0.251 | -0.150 | -0.179 | 1.000 | 0.134 | 0.089 | 0.311 | 0.198 | 0.301 | 0.008 |
| SGOT | -0.037 | -0.037 | -0.220 | 0.427 | 0.126 | 0.499 | 0.347 | 0.438 | 0.040 | 0.041 | -0.114 | 0.054 | 0.105 | 0.231 | -0.281 | -0.036 | 0.134 | 1.000 | 0.076 | 0.191 | 0.093 | 0.242 | 0.186 |
| Sex | 0.146 | 0.148 | 0.060 | 0.036 | 0.033 | 0.106 | 0.049 | 0.179 | 0.169 | 0.043 | 0.051 | 0.070 | 0.000 | 0.065 | 0.086 | 0.057 | 0.089 | 0.076 | 1.000 | 0.024 | 0.038 | 0.130 | 0.084 |
| Spiders | 0.082 | 0.084 | 0.230 | 0.099 | 0.209 | 0.321 | 0.072 | 0.282 | 0.125 | 0.000 | 0.257 | 0.133 | 0.223 | 0.329 | 0.271 | 0.212 | 0.311 | 0.191 | 0.024 | 1.000 | 0.308 | 0.324 | 0.078 |
| Stage | 0.104 | 0.104 | 0.153 | 0.063 | 0.195 | 0.137 | 0.035 | 0.138 | 0.118 | 0.027 | 0.231 | 0.116 | 0.206 | 0.526 | 0.175 | 0.134 | 0.198 | 0.093 | 0.038 | 0.308 | 1.000 | 0.273 | 0.078 |
| Status | 0.173 | 0.183 | 0.223 | 0.171 | 0.276 | 0.348 | 0.156 | 0.324 | 0.205 | 0.022 | 0.328 | 0.171 | 0.286 | 0.396 | 0.349 | 0.172 | 0.301 | 0.242 | 0.130 | 0.324 | 0.273 | 1.000 | 0.194 |
| Tryglicerides | 0.021 | 0.021 | -0.113 | 0.195 | 0.110 | 0.316 | 0.332 | 0.341 | 0.078 | 0.073 | -0.071 | 0.021 | 0.079 | 0.181 | -0.209 | -0.013 | 0.008 | 0.186 | 0.084 | 0.078 | 0.078 | 0.194 | 1.000 |
| N_Days | Age | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Status | Diagnosis_Date | Age_Years | Edema_N | Edema_S | Edema_Y | Drug | Sex | Ascites | Hepatomegaly | Spiders | Stage | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 999 | 21532 | 2.3 | 316.0 | 3.35 | 172.0 | 1601.0 | 179.80 | 63.0 | 394.0 | 9.7 | 2 | 20533 | 59 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 1 | 2574 | 19237 | 0.9 | 364.0 | 3.54 | 63.0 | 1440.0 | 134.85 | 88.0 | 361.0 | 11.0 | 0 | 16663 | 53 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 2 | 3428 | 13727 | 3.3 | 299.0 | 3.55 | 131.0 | 1029.0 | 119.35 | 50.0 | 199.0 | 11.7 | 2 | 10299 | 38 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 | 1.0 | 3.0 |
| 3 | 2576 | 18460 | 0.6 | 256.0 | 3.50 | 58.0 | 1653.0 | 71.30 | 96.0 | 269.0 | 10.7 | 0 | 15884 | 51 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 4 | 788 | 16658 | 1.1 | 346.0 | 3.65 | 63.0 | 1181.0 | 125.55 | 96.0 | 298.0 | 10.6 | 0 | 15870 | 46 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 3.0 |
| 5 | 703 | 19270 | 0.6 | 227.0 | 3.46 | 34.0 | 6456.2 | 60.63 | 68.0 | 213.0 | 11.5 | 2 | 18567 | 53 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 2.0 |
| 6 | 1300 | 17703 | 1.0 | 328.0 | 3.35 | 43.0 | 1677.0 | 137.95 | 90.0 | 291.0 | 9.8 | 0 | 16403 | 48 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 7 | 1615 | 21281 | 0.6 | 273.0 | 3.94 | 36.0 | 598.0 | 52.70 | 214.0 | 227.0 | 9.9 | 0 | 19666 | 58 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 2.0 |
| 8 | 2050 | 20684 | 0.7 | 360.0 | 3.65 | 72.0 | 3196.0 | 94.55 | 154.0 | 269.0 | 9.8 | 0 | 18634 | 57 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 9 | 2615 | 15009 | 0.9 | 478.0 | 3.60 | 39.0 | 1758.0 | 171.00 | 140.0 | 234.0 | 10.6 | 0 | 12394 | 41 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| N_Days | Age | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Status | Diagnosis_Date | Age_Years | Edema_N | Edema_S | Edema_Y | Drug | Sex | Ascites | Hepatomegaly | Spiders | Stage | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7895 | 1433 | 14161 | 0.5 | 291.0 | 4.24 | 37.0 | 1065.0 | 85.25 | 195.0 | 201.0 | 10.6 | 0 | 12728 | 39 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 7896 | 1271 | 13806 | 0.6 | 328.0 | 3.95 | 31.0 | 663.0 | 52.70 | 166.0 | 344.0 | 10.4 | 0 | 12535 | 38 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 7897 | 1455 | 16898 | 3.4 | 279.0 | 3.53 | 143.0 | 671.0 | 113.15 | 72.0 | 151.0 | 9.8 | 0 | 15443 | 46 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | 2.0 |
| 7898 | 77 | 19884 | 5.1 | 178.0 | 2.75 | 464.0 | 1020.0 | 120.90 | 118.0 | 80.0 | 12.3 | 2 | 19807 | 54 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 1.0 | 1.0 | 0.0 | 3.0 |
| 7899 | 1413 | 24622 | 1.3 | 262.0 | 3.73 | 65.0 | 2045.0 | 89.90 | 78.0 | 181.0 | 11.0 | 2 | 23209 | 67 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 7900 | 1166 | 16839 | 0.8 | 309.0 | 3.56 | 38.0 | 1629.0 | 79.05 | 224.0 | 344.0 | 9.9 | 0 | 15673 | 46 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 7901 | 1492 | 17031 | 0.9 | 260.0 | 3.43 | 62.0 | 1440.0 | 142.00 | 78.0 | 277.0 | 10.0 | 0 | 15539 | 47 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 3.0 |
| 7902 | 1576 | 25873 | 2.0 | 225.0 | 3.19 | 51.0 | 933.0 | 69.75 | 62.0 | 200.0 | 12.7 | 2 | 24297 | 71 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 |
| 7903 | 3584 | 22960 | 0.7 | 248.0 | 2.75 | 32.0 | 1003.0 | 57.35 | 118.0 | 221.0 | 10.6 | 2 | 19376 | 63 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 0.0 | 3.0 |
| 7904 | 1978 | 19237 | 0.7 | 256.0 | 3.23 | 22.0 | 645.0 | 74.40 | 85.0 | 336.0 | 10.3 | 0 | 17259 | 53 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 |